Hybrid Reinforcement Learning and Its Application to Biped Robot Control

نویسندگان

Satoshi Yamada

Akira Watanabe

Michio Nakashima

چکیده

A learning system composed of linear control modules, reinforcement learning modules and selection modules (a hybrid reinforcement learning system) is proposed for the fast learning of real-world control problems. The selection modules choose one appropriate control module dependent on the state. This hybrid learning system was applied to the control of a stilt-type biped robot. It learned the control on a sloped floor more quickly than the usual reinforcement learning because it did not need to learn the control on a flat floor, where the linear control module can control the robot. When it was trained by a 2-step learning (during the first learning step, the selection module was trained by a training procedure controlled only by the linear controller), it learned the control more quickly. The average number of trials (about 50) is so small that the learning system is applicable to real robot control.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Episodic Reinforcement Learning Control Approach for Biped Walking

This paper presents a hybrid dynamic control approach to the realisation of humanoid biped robotic walk, focusing on the policy gradient episodic reinforcement learning with fuzzy evaluative feedback. The proposed structure of controller involves two feedback loops: a conventional computed torque controller and an episodic reinforcement learning controller. The reinforcement learning part inclu...

متن کامل

Biped Balance Control by Reinforcement Learning

This work studied biped walking with single (one-leg) support and balance control using reinforcement learning. The proposed Q-learning algorithm makes a robot learn to walk without any previous knowledge of dynamics model. This balance control with single support shifts the Zero Moment Point (ZMP) of the robot to a stable region over walking sequences by means of learned gestures. Hence, the p...

متن کامل

Reinforcement Learning for a CPG-driven Biped Robot

Animal’s rhythmic movements such as locomotion are considered to be controlled by neural circuits called central pattern generators (CPGs). This article presents a reinforcement learning (RL) method for a CPG controller, which is inspired by the control mechanism of animals. Because the CPG controller is an instance of recurrent neural networks, a naive application of RL involves difficulties. ...

متن کامل

Dynamic Control Algorithm for Biped Walking Based on Policy Gradient Fuzzy Reinforcement Learning

This paper presents a novel dynamic control approach to acquire biped walking of humanoid robots focussed on policy gradient reinforcement learning with fuzzy evaluative feedback . The proposed structure of controller involves two feedback loops: conventional computed torque controller including impact-force controller and reinforcement learning computed torque controller. Reinforcement learnin...

متن کامل